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Abstract 

We consider the distributed function computation problem in asymmetric communication scenarios, where the sink computes 
some deterministic function of the data split among N correlated informants. The distributed function computation problem is 
addressed as a generalization of distributed source coding (DSC) problem. We are mainly interested in minimizing the number of 
informant bits required, in the worst-case, to allow the sink to exactly compute the function. We provide a constructive solution 
for this in terms of an interactive communication protocol and prove its optimality. The proposed protocol also allows us to 
compute the worst-case achievable rate-region for the computation of any function. We define two classes of functions: lossy and 
lossless. We show that, in general, the lossy functions can be computed at the sink with fewer number of informant bits than the 
DSC problem, while computation of the lossless functions requires as many informant bits as the DSC problem. 

I. Introduction 

Let us consider a distributed function computation scenario, where a sink node is interested in exactly computing some 
deterministic function / = f(X) of data-vector X that is split among N correlated informants. The correlation in informants' 
data is modeled by discrete and finite distribution V , known only to the sink (asymmetric communication, [1]). The sink and 
CZ2 , informants interactively communicate with each other, with communication proceeding in rounds, as in [2]. We are concerned 
with minimizing the number of bits that the informants send, in the worst-case, to allow the sink to compute the function. 

We consider the distributed function computation problem as a generalization of distributed source coding (DSC) proble 
The particular distributed function computation problem we consider is a generalization of DSC problem in asymmetric 
communication scenarios, we addressed in [3]. As for that work, the motivation for this work too comes from sensor networks, 
particularly from our efforts to address the distributed function computation problem in single-hop data-gathering wireless sensor 
^] ] networks, while maximizing the worst-case operational lifetime of the network. In a typical data-gathering sensor network, it 
is reasonable to assume that the base-station has large resources of energy, computation, and communication as well as the 
knowledge of correlations in sensor data, whereas a sensor node is resource limited and only knows its sampled data-values. 

■ Therefore, we argue that in such communication scenarios, the onus should be on the base-station to bear most of the burden 
Q\ ] of computation and communication associated with function computation. Allowing interactive communication between the 

■ base-station and sensor nodes lets us precisely do this: base-station forms and communicates efficient queries to sensor nodes, 
J> \ which they respond to with short and easily computable messages. This reduces the communication and computation effort at 

k>( ' sensor nodes, hence enhancing their lifetime, which in turn leads to increased network lifetime. 

r> | The distributed function computation problem was first addressed by Yao in [2] and later by other researchers in different 

■ setups, as we discuss in Section [TTJ However, our work mainly differs from the extant work in one or more aspects as follows. 
First, we approach the distributed function computation problem as a generalization of DSC problem. This allows us to exploit 
the correlation in informants' data to solve the function computation problem at the sink with fewer informant bits. Second, 
we are concerned with asymmetric communication (only sink knows the joint distribution of informants' data) and asymmetric 
computation (only sink computes the function). Third, we are concerned with the worst-case analysis. Fourth, we are interested 
in distributed function computation with a single instance of data at informants (one-shot computation problem). Finally, we 
consider a more powerful model of communication where the sink and informants interactively communicate with each other. 
Our work allows us to clearly delineate the roles played in optimally solving distributed function computation problem in 
arbitrary networks by correlation in informants' data, the properties of the function to be computed, communication model, 
network connectivity graph, and routing strategies. In this sense, our work acts as a fundamental building block to a general 
theory of distributed function computation over arbitrary networks, which we expect to eventually develop. 

In Section Hill we revisit the notion of information ambiguity, an information measure we proposed in [4] for the worst-case 
information-theoretic analyses, and extend it to a form useful in the present context. In Section IIV1 we provide the details 
of the communication model we assume and formally introduce the variant of distributed function computation problem we 
address in this paper. In the next section, we give a communication protocol to compute any given function at the sink, prove 
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DSC problem is a special case of distributed function computation problem where the function to be computed is identity map, id-^F. 



its optimality with respect to minimizing the number of informant bits, and provide the bounds on its performance. Finally in 
Section [VI] we discuss some properties of distributed function computation problem and propose a classification scheme for 
functions, based on the number of informant bits required, in general, to compute those at the sink. 

II. Related Work 

There are three major existing approaches to address the distributed function computation problem, as follows: 

Communication complexity: The seminal paper by Yao [2] introduced the problem of computing the minimum number of bits 
exchanged between two processors when both the processors compute a function of the input that is split between processors. 
Variants of this problem and numerous solution approaches have been explored in the field of communication complexity, [1]. 
This work provides insights into developing efficient communication protocols for function computation. However, it is mainly 
interested in estimating the order-of-magnitude of the bounds on communication and computation costs. Also, it is not obvious 
how to extend this work, when for example, one or more nodes in the network are interested in computing some function of 
source nodes' data or the source data is split among more than two nodes and is possibly correlated. 

Scaling laws: Recently in [5]-[7], the distributed function computation problem has been addressed to find how the rate 
of function computation scales with network size. This approach however does not provide a simple framework to exploit 
the correlation in source data and to incorporate stronger models of computation and communication, such as interactive 
communication, data-buffers, cooperating sources. 

Information theory: Much before Yao introduced his formulation of distributed function computation problem, Slepian and 
Wolf in [8] introduced the DSC problem. It was many years before distributed function computation problem was seriously 
addressed in information-theoretic setup, [9]— [12]. Still, there is very little such work that comprehensively addresses the 
distributed function computation problem over any given network, function, and model of communication and computation. 

III. Information Ambiguity for Distributed Function Computation 

We revise and generalize some relevant definitions and properties of information ambiguity, an information measure we 
introduced in [4] for performing the worst-case information-theoretic analysis in certain communication scenarios. We then 
extend the notion of information ambiguity to a form useful for distributed function computation in this paper. 

Note: All the logarithms used in this paper are to the base 2, unless explicitly mentioned otherwise. 

Let us consider a iV-tuple of random variables {X\, . . . , X^) ~ V = p(x\, . . . , x^), Xi € X, i S {1, . . . , N}, where X is 
discrete and finite alphabet of size \X\. The support set of (Xi, . . . , Xjy) is defined as: 

def 

Sx lt ...,x N = {{xi, ■ ■ .,x N )\p(x!,. . .,X N ) > 0} (1) 

We also call Sx! x N as the ambiguity set of {X\, . . . , Xjy), The cardinality of Sx lt ...,x N is called ambiguity of (X±, . . . , Xn) 

and denoted as Hx lt ...,X N = \Sxt x N \- So, the minimum number of bits required to describe an element of Sxi,,,,,x N > m 

the worst-case, is [log/iXi,...,Xivl- 

The support set Sxi of X i: i E {1, . . . , N}, is the set 

def def 

Sxi = {xi ■ for some X- h (x- l ,x l ) <E S Xl ,...,x N }, with X-i = {x 1: . .. ,x N } \ x t (2) 

of all possible Xi values. We also call Sxi ambiguity set of X,. The ambiguity of Xi is defined as /j,Xi = \Sxi |- The conditional 
ambiguity set of (X\, . . . , Xn), when random variable Xi takes the value Xi, Xi € Sxi, is 

def 

Sxx,...,X N \xMi) = {{x\, ■ ■ - ,x N ) : {X!,, . .,x N ) <E Sx u ...,x N and Xi <E S Xl }, (3) 

the set of possible (Xl,...,Xjv) values when Xi = Xi. The conditional ambiguity in that case is Hx 1 ....,x N \x i i x i) = 
\Sxx x N \Xi ( x i)\> the number of possible values of (Xi, . . . ,Xjy) when Xi = Xi. The maximum conditional ambiguity of 
(Xi, . . -,X N ) is 

V-x u ...,x N \x z = sup{^ Xl! ._ Xjv | X .(x l ) : Xi £ S x ,}, (4) 

the maximum number of (X\, . . . , Xn) values possible with any value that Xi can take. 

In fact, for any two subsets Xa and Xb of \X\, . . . , X^}, such that Xa U Xb C {A^i, . . . , X^} and Xa f~l Xb = <t>, 
we can define for example, ambiguity set Sx A of Xa, conditional ambiguity set Sx a \x b {xb) of Xa given the set xb of 
values that Xb can take, and maximum conditional ambiguity set Sx A \x B °f ^a for any set of values that Xb can take, 
with corresponding ambiguity, conditional ambiguity, and maximum conditional ambiguity given by /ix 4 , l i x A \x B ( x b), and 
fix A \x B > respectively. However, for the sake of brevity, we do not develop the precise definitions of these quantities here. 

Further, let us represent each of fiXi values that random variable Xi can take in [log nxt 1 bits as b\ . . . 6j- log -, . Let 
binary^Xi) represent the value of j th , 1 < j < [log/itx^l* bit-location in the bit-representation of Xi. Then, knowing that the 
value of j th bit-location is b, b £ {0, 1}, we can define the set of possible values that Xj can take as 

def 

Sxilb'W = i x i '■ x i e s Xi and binary^Xj) = b}, (5) 



TABLE I 

Notation used frequently in the paper 



N 
X 
V 

Xi 

s Xi 

S Xl ,...,x N 
Sx lt ...,x N \i 

Sf 
s f\i 

#f 

#DSC 



number of informants 

discrete and finite alphabet set of cardinality | X \ 

TV-dimensional discrete probability distribution, V = p(x\, . . . , Xjy), xi £ X 
random variable observed by i' h informant. Xi 6 X 

ambiguity set at the sink of i th informant's data, with corresponding ambiguity fix = \Sx \ 

ambiguity set at the sink of all informants' data, with corresponding ambiguity /L4jc, ...,Xn = I &X i ... x N \ 

conditional ambiguity set at the sink of all informant's data, when sink has information /, with corresponding 

conditional ambiguity ii Xi I = Sx ( The exact nature of / will be obvious from the context 

ambiguity set at the sink of the output values of function /, with corresponding ambiguity fif = \Sf\ 

conditional ambiguity set at the sink of the output values of function / when sink has information /, with corresponding 

conditional ambiguity = >S7|.rl 

minimum number of informant bits required in the worst-case to compute the function / at the sink 
minimum number of informant bits required in the worst-case to solve the DSC problem at the sink 



with corresponding cardinality denoted as Hx-wity- We can similarly define Sv. \h* (b) with Xi G Xa as 

z 3 ' 3 

def 

s x A \b)(b) = {x A ■ x A G S Xa and binary^;) = b}, (6) 

with corresponding cardinality denoted as Hxaw- (b)- The definitions of conditional ambiguity sets in ([5]) and (O can be easily 
extended to the situations where the values of one or more bit-locations in one or more random variable's bit-representation 
are known, but once more for the sake of brevity, we omit the details of such extended definitions. 

Next, we introduce the notion of the ambiguity set and ambiguity of the function output values. The support-set of output 
values of some function /, also called ambiguity set of function output values of function /, is defined as: 

def 

Sf = {f(xi,...,x N ) : for some (xi, ... ,x N ) G S Xl ....,x N } (7) 

The cardinality of Sf is called ambiguity of output values of function / and denoted as fj,f = \Sf\. So, the minimum number 
of bits required to describe an element in Sf is [log/i/]. The conditional ambiguity set of function output values when 

Xi = Xi,Xi G Sxt >i G {1, . . . , ./V}, is defined as 

Sf\Xii%i) = {f{x x , . . .,x N ) : for some {x u .. .,x N ) G S Xu ...,x N \xA x i)} ( 8 ) 

The corresponding cardinality is called conditional ambiguity of function output values when Xi = Xi and denoted as jU/ix, i x i)- 
We can further define the maximum conditional ambiguity of function output values as 

V>f\Xi sup{^ / | X .(x i ) : Xi G S Xz } (9) 

maximum number of function output values possible over any value that Xi can take over Sx t ■ The definitions in ([8]) and (O 
can be similarly extended to the situations where the conditioning is carried out over a subset Xa of {Xi, . . . , Xjy}. We omit 
the discussion of such extensions here. 

Further, when the value of j th bit-location in the binary-representation of Xi, Xi G Sx*, is known, that is 6* = b 1 b G {0, 1}, 
we can define corresponding conditional ambiguity set of function output values as follows 

def 

s f\b'(b) = {/(xi, . . -,x N ) : (x 1; . . -,x N ) G S Xl ,...,x N and x t G S Xi \bi(b)}, (10) 

3 J 

with corresponding cardinality denoted as fj,fi b i (b). 

If the function / is defined for every Xa,Xa C {Xi, . . . ,Xn}, then for a given support-set Sx lt ...,x N of data-vectors, 
the functional [log///] is a valid information measure as it satisfies various axioms of such measures, such as expansibility, 
monotonicity, symmetry, subadditivity, and additivity, [13]. We omit the details of proof for the sake of brevity . 

In the Table U we summarize the notation used frequently in this section and in the rest of the paper. 

IV. Distributed Function Computation in Asymmetric Communication Scenarios 

Let us consider a distributed function computation scenario, where a sink computes some function of the data of N correlated 
informants. We assume the asymmetric communication, where the joint distribution V of informants' data is known only to 
the sink. The Figure [TJ depicts this scenario for N = 2. 

Problem Statement: A sample X = [x\, . . . ,xn) is drawn Ltd. from a discrete and finite distribution V over N binary 
strings, as in [14], [15]. The strings of X are revealed to N informants, with the string Xi, i G {1, . . . , N}, being given to the i th 
informant. The sink wants to exactly compute a deterministic function / = f(X) of informants' data X (one-shot computation 
problem). Our objective is to minimize the total number of informant bits required, in the worst-case, to accomplish this. 

The Problem Setting: We consider an asymmetric communication scenario [1]. Communication takes place over N binary, 
error-free channels, where each channel connects an informant with the sink. An informant and the sink can interactively 
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Fig. 1. Distributed function computation problem for two informants in asymmetric communication scenarios. 

communicate over the channel connecting them by exchanging messages (finite sequences of bits determined by agreed 
upon, deterministic protocol). The informants cannot communicate directly with each other, though. We assume that the 
communication between the sink and the informants proceeds in rounds, as in [2]. In each round, depending on the information 
held by the communicators, one or other communicator may send the first message. However, we assume, as in [16], that in 
each communication round, first the sink communicates to the informants and then, the informants respond with their messages. 
Each bit communicated over any channel is counted, as either a sink bit if sent by the sink or an informant bit if sent by an 
informant. 

We assume the informants to be memoryless in the sense that they do not remember the messages they send in different 
rounds. We assume that i th informant knows its support-set Sx t , so that it represents the binary string Xi, given to it, as 

fe l--- 5 h°gMx,l 111 riogMA'J bits. 

The sink knows the distribution V and the corresponding support-sets: Sx lr ..,x N of data-vectors and Sf of function output 
values. So, every Sxi,...,Xnt can be uniquely described using \log [iXi,...,Xn~\ bits and every f(X) can be uniquely 

described using [log /if] bits. This implies that to compute f(X) unambiguously, the sink must receive at least [log/z/] bits 
from the informants, in the worst-case. 

For the design and analysis of efficient communication protocols for distributed function computation, we develop a problem- 
encoding scheme as follows. Every informant data-vector X,X E Sxi,...,Xn> can a ^ so De uniquely described by concatenating 
the bit-representations of all corresponding Xi, 1 < i < N. That is, X can be represented at the receiver by ^2i=i RoSAO^] bits 
long representation, constructed by concatenating i th informant's [log/iX;] bit-representation of Xi, for each i 6 {1, . . . , N}. 
With this encoding scheme, our distributed function computation problem reduces to minimizing the number #/ of bit-locations 
in the concatenated bit-representation of X, whose values the sink needs to exactly compute f(X). It should be noted that 
trivially, [log/i/] < #/ < E^Li f"logMx,l- 

We illustrate this problem-encoding scheme with an example support-set in Figure [2] Let the informants 1 and 2 observe 
two correlated random variables Xi and X 2 , respectively, with (Xi,X 2 ) derived from the support-set in first column. Let the 
function / being computed at the sink be 'bitwise OR' of the instance of (X\, X 2 ) revealed to the informants. For the given 
support-set, at least \\ognx 1 .x 2 ~\ = 4 bits are required to describe any element of Sx-l,x 2 an d at le a st [log/i/] = 3 bits are 
required to describe any element of Sf. Also, to individually describe any value assumed by X\ and X 2 , it requires 3 bits. 

For any given support-set of data-vectors, sink that knows the joint distribution V, can construct a problem-encoding as 
in Figure [2] It knows that one string, hitherto unknown, from the fourth column is drawn, with first [log nxi ] bits given to 
informant 1, next [log/ix 2 ] bits given to informant 2, and so on. We require the sink to exactly evaluate the given function 
/ on this string, whose different parts are held by different informants, with the informants sending minimum total number of 
bits to the sink. 

Note on the terminology: We call a bit-location in the bit-string at an informant (as well as in the bit-representation of X 
in encoding scheme defined above) defined, if the sink knows its value unambiguously, otherwise it is called undefined. For 
example, until the sink learns of the actual X revealed to the informants, one or more bits in the AOf;] bits long 

representation of X, remain undefined. Similarly, a bit-location in the bit-representation of the output of the function / is 
called evaluated if the sink can unambiguously compute its value based on the values of one or more bits in informant strings. 

V. Communication Protocol for Distributed Function Computation 

We address the distributed function computation problem, introduced in the last section, in bit-serial communication scenarios, 
where in each communication round, only one informant can send only one bit to the sink. This is an example of scenarios 
where communication takes place over a channel with uplink throughput constrained to one bit per channel use. Our interest in 
this communication model stems from it allowing us to compute the minimum number of informant bits (total and individual) 
required to compute f(X) at the sink when any number of rounds and sink bits can be used. In other words, this communication 
scenario enables us to compute the worst-case achievable rate-region for this problem, as we show later in this section. 

We provide a constructive solution of the distributed function computation problem of the last section, based on interactive 
communication. The proposed protocol optimally solves this problem and computes the worst-case achievable rate-region. We 
call the proposed protocol "bit-serial function Computation (bSerfComp)" protocol and describe it next. 
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S f = {000, 001,010,011, 110} 
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Elements of S 

(1,1) 
■" (1,3) 
(2,2) 
(2,4) 
(3,1) 
(3,3) 
(3,5) 
(4,2) 
(4,4) 
(5,3) 
(b) 



X 1' X 2 



Binary Representation 

( 000, 000 ) 
( 000, 010 ) 
(001,001 ) 
(001,011 ) 
(010, 000) 
(010,010) 
(010, 100) 
(011,001 ) 
(011,011 ) 
( 100,010) 
(O 



Concatenated Binary 
Representation 

000000 
000010 
001001 
001011 
010000 
010010 
010100 
011001 
011011 
100010 
(d) 

= 10, = Vx 2 



Function Output for 
'bitwise OR' 



000 
010 
001 
011 
010 
010 
110 
011 
011 
110 
(e) 

5 (b) the members of 



Fig. 2. Example of problem encoding: (a) Support-sets: Sxi,X 2 , ftfi , Sx 2 > and Sf with nx 1 ,X 2 = lOj/WEi = / J -x 2 = 5,/U/ 

Sxx,x 2 (c) binary representation of members of Sx 1 X 2 (d) the concatenated binary representation. If the string '000010' is drawn, then '000' is given to 
informant 1 and '010' is given to informant 2. (e) Function output values corresponding to X for 'bitwise OR'. 

A. The bSerfComp protocol 

In bSerfComp protocol, in each communication round only one bit is sent by the informant chosen to communicate with 
the sink. The chosen bit has the property that it divides the size of the current conditional ambiguity set of function output 
values, at the sink, closest to halo Formally, in terms of the problem statement and encoding introduced in the last section, if 
U is the set of undefined bits in J2iLi ["^OSM-X^l bits long representation of X, then the bit chosen in ^ th , I > 0, round is the 
one that solves argmin Jg(7 max^^g/o^} M/ife^-v The sink, after receiving the value of the chosen bit, recomputes the set of 
undefined bits U. This is carried out iteratively till all bits in [log/i/] bits long representation of f(X) are not evaluated. 



Algorithm: bSerfComp 



Xi,...,Xi\ 



1 1 = 

2 Let S l Xi Xn = S Xl ,...,x N 

3 Let S l f = S f , (i f = | 

4 Let V = {l,...,EiIi flog Mil} _ 

5 Let U be the set of undefined bits in V, U C V, over all X E S l 

6 while (yUj > 1) 

7 K l+1 = argmin jer/ max fc(i)e{0 , 1} ^ /|b(j) 

8 Choose the bit-location corresponding to k l+1 , where k l+1 is a randomly chosen element of K l+1 

9 The sink asks the informant corresponding to bit-location k l+1 to send the bit-value b(k l+1 ) 

10 Set Sxi,...,X N = S Xl ,...,X N \b(k l + 1 ) 

1 1 Set < ?' +1 — 

12 Compute U C V, the set of undefined bits 

13 1 = 1 + 1 

The sink can perform the worst-case performance analysis of the bSerfComp protocol by selecting on the line [9] b*(k l+1 ) 
that solves: 

b*(k l+1 ) = argmax//,| 6(fc(+1 s =s 

s={0,l} 

Note that there are two versions of the bSerfComp protocol: in the online version, the sequence of queries from the sink 
to the informants is determined adaptively depending on the informant response in the previous rounds, while in the offline 
version, for a given support-set of data-vectors the entire sequence of queries is determined before actual querying starts. For 
example, the sequence of queries generated for the worst-case analysis of the protocol corresponds to the offline version. 



For n-ary representation of data-values, this will be 1/n. 



B. Optimality of bSerfComp protocol 

The binary representations of the elements of Sf, as in Figure [2]e, can be arranged as the leaves of a binary tree, where 
ambiguity set of function output values Sf forms the root and conditional ambiguity sets of function output values form 
internal nodes and leaves. The set of function output values corresponding to a child node is obtained by conditioning the set 
of function output values corresponding to its parent node on the value b, b € {0, 1} of by. j th bit-location in the binary string 
revealed to i th informant, with '6 = 0' leading to the left subtree and '6=1' leading to the right subtree. Such a binary tree 
with Hf leaves will have a minimum-height of [log/x/], implying that at least [log^/] bits are required to describe any leaf, 
in the worst-case. 

Lemma 1: bSerfComp protocol computes all minimum-height binary trees corresponding to the given support-set to exactly 
evaluate a given function /. 

Proof: Follows from the definition of minimum-height binary trees and the description of bSerfComp protocol. ■ 

Lemma 2: bSerfComp protocol computes bi, the minimum number of bits that the i th ,i <E {1, . . . , N}, informant must 
send to let the sink exactly evaluate the function /. 

Proof: The bSerfComp protocol exploits the bit-serial communication scenario where a bit queried from the chosen 
informant maximally conditions the resultant ambiguity set of function output values at the sink. Also, to reduce the number 
of bits that an informant sends, the bSerfComp protocol can procrastinate querying the bits from the concerned informant 
until it can be postponed no more, thus maximally reducing the number of bits an informant sends. Combining these two 
observations, proves the lemma. ■ 

Lemma 3: For a given support-set, each corner point of the worst-case achievable rate-region for computing function / 
corresponds to at least one minimum-height binary tree, with height 

Proof: For the sake of contradiction, let us assume that there is a corner point of the worst-case achievable rate-region to 
which no minimum-height binary tree corresponds to. This implies that this corner point is outside the worst-case rate-region 
defined by the set of all the corner points visited by the set of minimum-height binary trees. This further implies that at this 
corner point at least one informant, say i th , sends fewer bits than 6; (defined in the statement of Lemma [2] above). However, 
this contradicts the definition of bi, that it is the minimum number of bits i th informant needs to send to let the sink exactly 
evaluate the function /. Thus, there cannot be any corner point outside the rate-region defined by the set of corner points 
corresponding to the set of all minimum-height binary trees, hence proving the lemma. ■ 

Theorem 1: For a given support-set, bSerfComp protocol computes the worst-case achievable rate-region for function /. 
Proof: Combining the statements of Lemmas Q] and [3] we can state that bSerfComp protocol computes each corner 
point of the worst-case achievable rate-region. Thus, bSerfComp protocol computes the worst-case achievable rate-region for 
computing function /. ■ 

The worst-case achievable rate-region for distributed computation of function / in asymmetric communication scenarios is 
given by the following corollary to Theorem Q] For the sake of notational simplicity, we state it only for N = 2. 

Corollary 1: For N = 2, if bi denotes the minimum number of bits that an informant i, 1 < i < 2, sends over all solutions 
of bSerfComp protocol and #y denotes the total number of bits sent by all informants, then the worst-case achievable rate 
region is given by: 
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Proof: The proof follows from the worst-case optimality of bSerfComp protocol proven in Theorem Q] ■ 
In Figures 04] using bSerfComp protocol, we compute the worst-case achievable rate-regions for functions: 'bitwise OR', 
'bitwise AND', and 'bitwise XOR', evaluated at sink over two support-sets of data-vectors for two correlated informant^ 

C. Performance bounds for bSerfComp protocol 

To compute the bounds on the performance of bSerfComp protocol, we make use of an interesting and important observation 
regarding the working of bSerfComp protocol to compute a given function / at the sink for a given support-set of data-vectors. 

Observation: A bit-location in the bit-representation of the function output values can be evaluated, without all bit-locations 
in the concatenated bit-representation of X being defined. 

Let #/ and #_dsc denote the minimum number of informant bits required, in the worst-case, to evaluate the function / 
and to solve the DSC problem, respectively, at the sink for a given support-set of data-vectors. 

3 In computer programming literature, it is well-know how to compute the bitwise functions over two binary strings, [17]. Let B(Xi, Xj) denote the output 
of the bitwise function B evaluated over binary-strings corresponding to Xi and Xj, i.j £ {1, . . . , N}, i ^ j. Define B(Xi) = Xi. Then, the evaluation 
of B over any number N, N > 2, of arguments can be recursively defined, for example, as: B(Xi, . . . , Xjy) = B(B(Xi, . . . , Xjv— l), X^). 
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Fig. 3. Distributed function computation - 1: support-set of data-vectors (a) and worst-case achievable rate-regions for 'bitwise OR' in (b), for 'bitwise AND' 
in (c), and for 'bitwise XOR' in (d) 
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Fig. 4. Distributed function computation - II: support-set of data-vectors (a) and worst-case achievable rate-regions for 'bitwise OR' in (b), for 'bitwise 
AND' in (c), and for 'bitwise XOR' in (d) 

Loose Bounds: As we discussed before, #/ is bounded from below by [log fj, f\ , that is 

riog/i/i < #/ 

Let /j, s be the number of data-vectors or informant strings which evaluate to the function output value s,s e Sf. Also, let 
us define [i™ m = min s gs, \l s . Then, assuming that the function / evaluates to the output that corresponds to /i™ m , we obtain 
a trivial but useful upper bound on #j as: 

#/ <#dsc- [logMr"! 
Therefore, combining above two bounds on #/, we can say that loosely satisfies: 

[log fXf] < #/ < # DSC - [log ft?™] (11) 

Tight bounds: For a given support-set of data- vectors, [log fj, /] is the lower bound on minimum number of informant bits 
required to evaluate the output of function /. Let us assume that the sink has obtained [log /i f\ informant bits using bSerfComp 
protocol. Let assume that the size of conditional ambiguity set of data-vectors at the end of I th round, 1 < I < [log///~|, is 
l/2 1_£i of its size at the beg inning of this round. Define e m ax — max{ei, . . . , epiog^j] }■ Then, the size of conditional ambiguity 
set of data- vectors after [log /j, /] informant bits are received satisfies: 

HXi,...,x N ^ MXj 



/i (1 _ ei) 



< 



2 (i- 



Now, if 



< 



2(l-Cmax)riogM/l 

then, the function output evaluation finishes with [log/i/] to [log/x/] + [log/x™ m ] informant bits. So, we have 

fiogA*/i < #/ < riog/i/i + [io g /ir m i 

and in this case the lower bound in (fTTT i is actually tight. 

HXt,...,Xt 



(12) 



Otherwise, that is, if 



2 (i- 



)flogfi/| 



then, the function computation finishes in |~log/^/] + [log/j*] to [log/i/] + log a x ' j ' p f " M f i 
of smallest subset of function output values that satisfies 

Hx u ...,x„ 



bits, where /j* is the size 



2U- 



< 



|S|= M * 



Therefore, in this case we have: 



flognf] + flog pi*] < #/ < flogM/1 + lo S^(T 



^x x ,...,x^ 



VI. Some Properties of Distributed Function Computation 



(13) 



We discuss some of the significant results, properties, and observations based on our work on distributed function computation 
problem in asymmetric communication scenarios. 

A. Two Classes of Functions 

Let us consider two deterministic functions g = maxjXi,^} and h = X* 2 . For two or more data- vectors derived from 
any discrete and finite support-set, the function g may evaluate to same output value. On the other hand, the function h assigns, 
in general, a unique output value to each of its input pairs. Generalizing this to the functions of N, N > 2, variables computed 
over corresponding discrete and and finite support-sets, there are various functions whose behavior is either like function g or 
like function h above. 

The common statistical functions, such as 'max', 'min', 'majority, 'mean', 'median', and 'mode' and logical functions, such 
as 'parity', 'bitwise OR', 'bitwise AND', and 'bitwise XOR' belong to a class of functions, which we call lossy functions. 
Similarly, the functions such as 'identity function', 'iterated exponentiation', Xie Xj belong to a class of functions, which 
we call lossless functions. Formally, for the lossy functions the cardinality of their range is smaller than the cardinality of their 
domain, while for the lossless functions two cardinalities are equal. In fact, it is easy to prove that the equality of the sizes of 
domain and range of a function is an equivalence relation and classes of lossy and lossless functions are equivalence classes. 

The reason these two equivalence classes of functions are relevant in the discussion of distributed function computation is 
that, in general, the computation of lossy functions at the sink requires fewer number of informant bits than computation of 
lossless functions. As DSC belongs to the equivalence class of lossless functions (DSC is distributed function computation with 
function to be computed being the identity map: id-^), this implies that, in general, for a given support-set the computation of 
lossless functions requires as many informant bits, in the worst-case, as the solution of DSC problem, while the computation 
of lossy functions requires fewer number of informant bits than DSC. 

The bSerfComp protocol of the last section can be used to compute both, the lossy and lossless functions. However, as 
for the lossless functions, the bSerfComp protocol reduces to much simpler bSerCom protocol of [3] for computing DSC 
in the corresponding communication scenario, the latter can be deployed at the sink for their computation in asymmetric 
communication scenarios. 

Also, for the lossless functions the knowledge of function output allows us to uniquely determine the input data-vector 
revealed to the informants (reversible function computation), while for the lossy functions this is not possible (irreversible 
function computation). This apparent loss of information accompanying the computation of lossy functions results in their 
computation with fewer number of informant bits, but at the cost of sacrificing our ability to recover the input data-vector 
from their output. For the lossless functions, there is no such information loss in their computation, allowing the unambiguous 
recovery of the input data- vectors from their output, but at the cost of larger number of informant bits. 

TABLE II 

Comparison of lossy and lossless function computation 



Lossy Functions 



Lossless Functions 



1. Examples: various common statistical and bitwise functions 

2. Range of the function is smaller than its domain 

3. Complex bSerfComp protocol is used for computation 

4. The worst-case rate-region is larger than DSC 

5. The sink cannot unambiguously recover the data- vector revealed 
to the informants from function output value 



1. Examples: DSC, iterated exponentiation 

2. Range of the function is of same size as its domain 

3. Simple bSerCom of [3] is used for computation 

4. The worst-case rate-region coincides with DSC 

5. The sink can unambiguously recover the data-vector revealed 
to the informants from function output value 



It should be noted that above classification of functions holds true for any given support-set of data-vectors, in general. 
However, one can always concoct exceptions where the cardinality of the support-set of function output values for some lossy 
function is same as the cardinality of the corresponding support-set of data-vectors. Similarly, some exception for lossless 
functions can be constructed, where the cardinality of the support-set of function output values is smaller than the cardinality 
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Fig. 5. 'max' computation for support-set in (a) requires #/ = 4 informant bits, (b) requires #^ = 3 informant bits, (c) requires # j = 4 informant bits, 
and (d) requires #/ = 3 informant bits 
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Fig. 6. 'max' computation for support-set in (a) requires # j = 6 informant bits, (b) requires #y = or no informant bits, (c) requires # f = 2 informant 
bits, and (d) requires #y = 2 informant bits 

of corresponding support-set of data- vectors. We state without proof that the number of such instances of support-sets is small 
for any given cardinality of the support-sets. Further, in all situations the following lemma always holds for any function /. 

Lemma 4: If [log^/] = [log^DScl, then #/ = #dsc- Also, if \\og Li f ~\ < \\og li D sc~\, then #/ < #dsc- 

Proof: Omitted for brevity. ■ 

This brings us to relating our work on function classification with Han and Kobayashi's work along similar lines, [9]. We 
establish two equivalence classes of functions: lossy and lossless. Given that DSC problem belongs to the class of lossless 
functions, the worst-case achievable rate-region of lossless functions coincides with the worst-case rate-region of DSC problem, 
while for lossy functions it is correspondingly larger. In [9] too, the authors have introduced such dichotomy of functions of 
correlated sources: for one class of functions the achievable rate-region coincides with Slepian-Wolf rate-region and for another 
class it does not. However, in spite of apparent similarities in results, there are some basic differences. First, we are interested 
in the worst-case information-theoretic analysis while authors in [9] are concerned with average-case analysis. Second, our 
results pertain to one-shot function computation, while [9] deploys block-encoding. 

It is interesting to ask if for a given communication scenario, we can always construct two or more equivalence classes 
of functions based on the communication cost of their computation. This appears to be a largely unexplored problem and a 
systematic answer that also unifies various previous attempts to classify functions based on their communication costs, as in 
[5], [9], and this paper, warrants our attention. Further, proposed two classes of the functions can be further refined based on 
other finer details of the functions and we actually expect the classification structure to be richer than just the dichotomous 
classification in the paper. As of now, our own work in these directions is in preliminary stage and we propose to address 
these issues comprehensively in the near future. 

B. Dependence of on [if and lldsc 

In subsection IV-CI we established how for a given support-set of data-vectors the minimum number of informant bits 
needed to compute the function / in the worst-case, depends on \if, the ambiguity of function output values, and /-Lx 1 ,....x N , 
the ambiguity of data-vectors. Now, let us consider how for a given function /, #f for two different support-set of data-vectors 
depends on corresponding [if and ^x 1 ,....x N - 

Let ix f and fi^ denote the cardinality of the set of function output values for first and second support-set, respectively. 

Let iijjgc and [i 2 DS q denote the cardinality of the set of data-vectors for first and second support-set, respectively. 



Finally, let and denote the minimum number of informant bits required to compute the function / for first and 
second support-set, respectively. 

In this subsection, we state without proof the relation between #1 and #j, given the relations between p}f and /A, and pj^sc 
and p? DSC . We provide an exhaustive list of various possibilities and provide an example for each when the sink computes 
max{Xi, X2] over data values of two informants for a given support-set. 

Property 1: p^ = pj, p D g C = pjjsc' iff an d iff mav or ma y not De equal, for example the support-sets in Figures [3] (a)-(b) 
for which 7^ iff- l n this case, it is clear that ambiguities corresponding to function output values and data- vectors alone 
cannot be used to establish the relation between #y and This deficiency of the notion of ambiguity is addressed in greater 
detail in one of our related papers, [18]. 

Property 2: p^ = p'j, pjjsc ^ iAdsc = ^ > iff an d $7 follow the ordering of p DSC and p 2 DSC - An illustration of this 
case is given by the support-sets in Figures |5](c)-(d). 

Property 3: p\ 7^ pA, p D sc = I^dsc == ' > iff an d ifj follow the ordering of p}^ and pi. Figures [6] (a)-(b) illustrate this. 

Property 4: p\ < p\, iAdsc < ^"bsc Of — H 2 f- Support-sets in Figures |6](c) and[5](c) illustrate this. 

Property 5: p\ < pA, ppsc > l^bsc = ^ iPf — iff - Support-sets in figures |6](d) and|5](d) illustrate this. 

VII. Conclusions and Future Work 

We address the distributed function computation problem in asymmetric and interactive communication scenarios, where 
the sink is interested in computing some deterministic function of input data that is split among TV correlated informants and 
is derived from some discrete and finite distribution. We consider the distributed function computation as a generalization of 
distributed source coding problem. We are mainly interested in computing # f, the minimum number of informant bits required 
in the worst-case, to allow the sink to exactly compute the given function. We provide bSerfComp protocol to optimally compute 
the functions at sink for any given support-set of data-vectors and prove it computes the worst-case achievable rate-region for 
computing any given function, illustrating this with examples. Also, we provide a set of bounds on the performance of the 
proposed protocol. 

We define two equivalence classes of functions: lossy and lossless. We show that the lossy functions can be computed, in 
general, with fewer number of informant bits than lossless function, such as DSC. Further, we establish the dependence of 
#f, when the function / is computed over two different support-sets, on their respective ambiguities of function output values 
and data-vectors. 

In future, we want to extend this work in three interesting directions. First, in this paper we have assumed that the sink and 
informants directly communicate with each other. Allowing the sink and informants to indirectly communicate with each other 
over one or more intermediate nodes (as in multihop networks), offers many more opportunities of reducing the number of bits 
carried over the network to compute a function at the sink. Second, allowing the sink to tolerate certain amount of error in the 
computation of the function may reduce the number of informant bits required. We want to address these directions formally 
in our setup. Finally, we want to come up with a generic framework to classify the functions based on the communication 
costs of their computation over arbitrary networks with any given model of communication and computation. 
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